Feat/kurganov shared memory #117

CyprienBosserelle · 2025-05-25T08:39:25Z

Test Jules coding assistant in optimising using shared memory

…to enhance their performance. Here's a summary of the changes: I've optimized six CUDA device functions: - updateKurgXGPU - updateKurgYGPU - updateKurgXATMGPU - updateKurgYATMGPU - AddSlopeSourceXGPU - AddSlopeSourceYGPU The main idea was to use a faster type of memory called shared memory. Here’s how I approached it: 1. I looked for parts of the code where a thread needs data from its neighbors. 2. For these parts, I set up shared memory to hold the data temporarily (e.g., for `XEv.h`, `XEv.zs`, `XGrad.dhdx`, `Patm`, `XFlux.Fqux`, etc.). 3. At the beginning of each kernel, I copied the necessary data from the main global memory into these shared memory areas, including any extra data needed for calculations involving neighbors. 4. I used a synchronization command (`__syncthreads()`) to make sure all data was loaded into shared memory before any calculations began, and also before writing any updated shared data back to the main memory. 5. I updated the calculations to use the data in shared memory instead of directly accessing global memory. 6. Finally, where needed, I copied the results from shared memory back to global memory (for example, with the flux arrays in the `AddSlopeSource*` kernels). I used some predefined constants (`STATIC_MAX_BLOCK_X = 16`, `STATIC_MAX_BLOCK_Y = 16`, `SHARED_MEM_HALO_WIDTH = 1`) to set the size of the shared memory arrays. I also added some checks to prevent errors if the way the kernels are launched doesn't quite match these predefined sizes. The logic for handling boundaries between different blocks of data (like when blocks have different levels of detail) still uses direct access to global memory. These optimizations apply to both single and double-precision versions of the kernels. The goal is to reduce the time spent waiting for data from global memory, which should make the kernels run faster.

CyprienBosserelle added 30 commits September 25, 2024 09:56

add structure for parallel process of rivers

3e17570

try to build new array to parallellise rivers

5918ee6

Fix some compile issue

3011e4d

Fix compile issue

65c82b6

Fix code and fill Xriver array

6c7489d

fix allocation blunder

c5c3cbe

Add functions for Pinned memory

e058769

Fix Compile paged mem

9cb83cb

Make test for paged mem

1b6c499

tweak test and remove out to screen

9fa3614

modify test for Pin Meme for non-GPU

00f02f3

Fix CPU only MappedMem

9c59e3e

add allocations of river info in GPU XModel

236b679

Add missing variable but also added template to various classes

b8e8ed9

Fix compile issues

ef90c69

fix map mem alloc

31ab89f

Add momentum adjustment when using rain

b0576b1

Add limiter for flux adjustment for dry cells

a957e97

playing up with velocity sanity

1c7018f

Add explanation to new algo

56bfaac

Revert experiment changes on roughness

306cc11

Fix Dynamic forcing

bda4751

force sane velocity

a835af5

Fix zsoffset

9ba730a

Clean <<< ...>>>

ad71ffa

Adding record of timestart

db30cd0

Update Makefile

8aa4c77

ad bnd filter and relax time to input param

50c7238

apply tapering on non-uniform bnd

385fcdc

Update InitialConditions.cu

e499aad

CyprienBosserelle and others added 8 commits April 29, 2025 14:31

add "all" and "file" as keyword for boundary forcing side

2942cc3

add sm50 to Makefile for auto test

83ae6ad

Update Param.h

a4c4094

add switch to enforce mass conservation in timestep selection

8ae704d

add mass conservation forcing as switch to param file

cc07697

Update ReadInput.cu

55771dc

Update Param.h

530c046

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/kurganov shared memory #117

Feat/kurganov shared memory #117

Uh oh!

CyprienBosserelle commented May 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Feat/kurganov shared memory #117

Are you sure you want to change the base?

Feat/kurganov shared memory #117

Uh oh!

Conversation

CyprienBosserelle commented May 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant